Implementing Linguistic Query Languages Using LoToS
نویسندگان
چکیده
A linguistic database is a collection of texts where sentences and words are annotated with linguistic information, such as part of speech, morphology, and syntactic sentence structure. While early linguistic databases focused on word annotations, and later also on parse-trees of sentences (so-called treebanks), the recent years have seen a growing interest in richly annotated corpora of historic texts that include not only syntactic annotations but further complex annotations, such as alignments between related text layers. This raises the issue of efficiently querying such complex structured linguistic databases. We present a generic approach for defining domain-specific query languages that we use in developing a query language for richly annotated historic corpora. In our approach, a query language is defined as a set of predicates. A query in form of a logic rule is translated by our LoToS query compiler into a single, possibly deeply nested SQL query. In contrast to previous approaches, the annotation structures that can be queried need not be trees but can also form DAGs, or, for a restricted class of recursive queries, arbitrary graphs. To this end, LoToS offers an operator for computing transitive closures using the recursive capabilities of modern database systems. We believe that this is the first approach to use modern SQL capabilities for evaluating recursive predicates in logic-based query languages. Chapter
منابع مشابه
Instantiation and Implementation of a Corpus Query Lingua Franca Master Thesis by
The present thesis introduces KoralQuery, a protocol for the generic representation of queries to linguistic corpora. KoralQuery defines a set of types and operations which serve as abstract representations of linguistic entities and configurations. By combining these types and operations in a nested structure, the protocol may express linguistic structures of arbitrary complexity. It achieves ...
متن کاملFuzzy linguistic logic programming and its applications
The paper introduces fuzzy linguistic logic programming, which is a combination of fuzzy logic programming, introduced by P. Vojtáš, and hedge algebras in order to facilitate the representation and reasoning on human knowledge expressed in natural languages. In fuzzy linguistic logic programming, truth values are linguistic ones, e.g., VeryTrue, VeryProbablyTrue, and LittleFalse, taken from a h...
متن کاملDeveloping Coherent Multiagent Systems using JAFMAS
The Java-based Agent Framework for Multi-Agent Systems (JAFMAS) provides a generic methodology for developing speech-act based multiagent systems (MAS) along with a set of classes to support implementing these agents in Java. The methodology follows five stages: (i) agent identification, (ii) definition of each agent’s conversations, (iii) determining the rules governing each agent’s conversati...
متن کاملA Wish List for the Behaviour Part of LOTOSVersion 2 . 0 Hubert GARAVEL
This document supersedes a previous Afnor contribution dated from June 1994 and entitled \Six improvements to the process part of Lotos". We propose nineteen changes, which aaect the syntax, static semantics and/or dynamic semantics of the behaviour part of Lotos. These changes aim at solving several problems found in Lotos and making the behaviour part of E-Lotos expressive, simple, symmetric ...
متن کاملQuerying Linguistic Trees
Large databases of linguistic annotations are used for testing linguistic hypotheses and for training language processing models. These linguistic annotations are often syntactic or prosodic in nature, and have a hierarchical structure. Query languages are used to select particular structures of interest, or to project out large slices of a corpus for external analysis. Existing languages suffe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005